DOMAIN DATABASE KNOWLEDGE Incompleteness Noise
نویسندگان
چکیده
There are several di erent ways data mining the automatic induction of knowledge from data can be applied to the problem of natural language processing In the past data mining techniques have mainly been used in linguistic engineering applications to solve knowledge acquisition bottlenecks In this paper we show that they can also assist in linguistic theory formation by providing a new tool for the evaluation of linguistic hypotheses for the extraction of rules from corpora and for the discovery of useful linguistic categories Applying Quinlan s C inductive machine learning method to a particular linguistic task diminutive formation in Dutch we show that data mining techniques can be used i to test linguistic hypotheses about this process and ii to discover interesting linguistic rules and categories
منابع مشابه
Planning in Incomplete Domains
Engineering complete planning domain descriptions is often very costly because of human-error or lack of domain knowledge. While many have studied knowledge acquisition, relatively few have studied the synthesis of plans when the domain model is incomplete (i.e., actions have incomplete preconditions or effects). Prior work has evaluated the correctness of plans synthesized by disregarding such...
متن کاملDOMAIN DATABASE KNOWLEDGE Incompleteness
There are several diierent ways data mining (the automatic induction of knowledge from data) can be applied to the problem of natural language processing. In the past, data mining techniques have mainly been used in linguistic engineering applications to solve knowledge acquisition bottlenecks. In this paper, we show that they can also assist in linguistic theory formation by providing a new to...
متن کاملA FOIL-Like Method for Learning under Incompleteness and Vagueness
Incompleteness and vagueness are inherent properties of knowledge in several real world domains and are particularly pervading in those domains where entities could be better described in natural language. In order to deal with incomplete and vague structured knowledge, several fuzzy extensions of Description Logics (DLs) have been proposed in the literature. In this paper, we present a novel F...
متن کاملUtilizing Goal-Directed Data Mining For Incompleteness Repair In Knowledge Bases
In this paper we present a methodology for goal-directed data mining of association rules and incorporation of these rules into a probabilistic knowledge base. The purpose of the data mining and rule extraction process is to repair knowledge base incompleteness uncovered during validation. We discuss how this incompleteness is uncovered and show the fundamental forms this incompleteness can tak...
متن کاملKnowledge Discovery in the Prediction of Bankruptcy
Knowledge discovery in databases (KDD) is the process of discovering interesting knowledge from large amounts of data. However, real-world datasets have problems such as incompleteness, redundancy, inconsistency, noise, etc. All these problems affect the performance of data mining algorithms. Thus, preprocessing techniques are essential in allowing knowledge to be extracted from data. This work...
متن کامل